Error Control for Support Vector Machines
نویسندگان
چکیده
Error Control for Support Vector Machines by Mark A. Davenport In binary classification there are two types of errors, and in many applications these may have very different costs. We consider two learning frameworks that address this issue: minimax classification, where we seek to minimize the maximum of the false alarm and miss rates, and Neyman-Pearson (NP) classification, where we seek to minimize the miss rate while ensuring the false alarm rate is less than a specified level α. We show that our approach, based on cost-sensitive support vector machines, significantly outperforms methods typically used in practice. Our results also illustrate the importance of heuristics for improving the accuracy of error rate estimation in this setting. We then reduce anomaly detection to NP classification by considering a second class of points, allowing us to estimate minimum volume sets using algorithms for NP classification. Comparing this approach with traditional one-class methods, we find that our approach has several advantages. Acknowledgements The work presented in this thesis was accomplished with a great deal of help from Clay Scott. I greatly appreciate his assistance and insight, and especially his occasional prodding. Thanks also to my advisor, Richard Baraniuk, for his enthusiastic encouragement, and to the additional members of my committee, Don Johnson and Rudolf Riedi, for their helpful feedback. I am also deeply indebted to Ryan King and Brandon Skeen who helped provide the computational resources without which this work would not have been possible. Finally, I would like to thank all of my other friends and family for their support. Thanks especially to my parents for (perhaps unwittingly) pointing me down this road, and to Kim for her constant reassurance, optimism, and encouragement.
منابع مشابه
کاربرد الگوریتمهای دادهکاوی در تفکیک منابع رسوبی حوزۀ آبخیز نوده گناباد
Introduction: Reduction of sediment supply requires the implementation of soil conservation and sediment control programs in the form of watershed management plans. Sediment control programs require identifying the relative importance of sediment sources, their quantitative ascription and identification of critical areas within the watersheds. The sediment source ascription is involves two...
متن کاملA prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملA prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملIdentification and Adaptive Position and Speed Control of Permanent Magnet DC Motor with Dead Zone Characteristics Based on Support Vector Machines
In this paper a new type of neural networks known as Least Squares Support Vector Machines which gained a huge fame during the recent years for identification of nonlinear systems has been used to identify DC motor with nonlinear dead zone characteristics. The identified system after linearization in each time span, in an online manner provide the model data for Model Predictive Controller of p...
متن کاملA Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels
The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...
متن کاملEvaluation of the Efficiency of Linear and Nonlinear Models in Predicting Monthly Rainfall (Case Study: Hamedan Province)
In this research, we used the support vector machine (SVM), support vector machine combine with wavelet transform (W-SVM), ARMAX and ARIMA models to predict the monthly values of precipitation. The study considers monthly time series data for precipitation stations located in Hamedan province during a 25-year period (1998-2016). The 25-year simulation period was divided into 17 years for t...
متن کامل